Search results for "Audio signal"
showing 10 items of 30 documents
2015
Visuo-auditory sensory substitution systems are augmented reality devices that translate a video stream into an audio stream in order to help the blind in daily tasks requiring visuo-spatial information. In this work, we present both a new mobile device and a transcoding method specifically designed to sonify moving objects. Frame differencing is used to extract spatial features from the video stream and two-dimensional spatial information is converted into audio cues using pitch, interaural time difference and interaural level difference. Using numerical methods, we attempt to reconstruct visuo-spatial information based on audio signals generated from various video stimuli. We show that de…
Real-time signal processing in embedded systems
2016
International audience
A Comparative Analysis of Residual Block Alternatives for End-to-End Audio Classification
2020
Residual learning is known for being a learning framework that facilitates the training of very deep neural networks. Residual blocks or units are made up of a set of stacked layers, where the inputs are added back to their outputs with the aim of creating identity mappings. In practice, such identity mappings are accomplished by means of the so-called skip or shortcut connections. However, multiple implementation alternatives arise with respect to where such skip connections are applied within the set of stacked layers making up a residual block. While residual networks for image classification using convolutional neural networks (CNNs) have been widely discussed in the literature, their a…
Self-Organizing Architectures for Digital Signal Processing
2013
Capturing and Indexing Rehearsals: The Design and Usage of a Digital Archive of Performing Arts
2015
International audience; Preserving the cultural heritage of the performing arts raises difficult and sensitive issues, as each performance is unique by nature and the juxtaposition between the performers and the audience cannot be easily recorded. In this paper, we report on an experimental research project to preserve another aspect of the performing arts—the history of their rehearsals. We have specifically designed non-intrusive video recording and on-site documentation techniques to make this process transparent to the creative crew, and have developed a complete workflow to publish the recorded video data and their corresponding meta-data online as Open Data using state-of-the-art audi…
On the relations between audio features and room acoustic parameters of auralizations
2013
The usual parameters in room acoustics are used to quantify the acoustic characteristics of rooms and their relation to the subjective perception of transmitted signals. Audio features (calculated with MIRToolbox) have been designed to study the relationships between the characteristics of musical audio files and their subjective perception. Both musical characteristics and acoustic parameters are oriented towards acoustic perception. By using auralizations with calibrated models of auditoriums and tools from the MIRtoolbox it is possible to jointly work with the calculation of audio features and room parameters. In this work, the statistical correlations between C80, STI, D50, EDT, RT and …
Modeling musical attributes to characterize ensemble recordings using rhythmic audio features
2011
In this paper, we present the results of a pre-study on music performance analysis of ensemble music. Our aim is to implement a music classification system for the description of live recordings, for instance to help musicologist and musicians to analyze improvised ensemble performances. The main problem we deal with is the extraction of a suitable set of audio features from the recorded instrument tracks. Our approach is to extract rhythm-related audio features and to apply them for regression-based modeling of eight more general musical attributes. The model based on Partial Least-Squares Regression without preceding Principal Component Analysis performed best for all of the eight attribu…
Spectral estimators for Doppler analysis of intracoronary ultrasound catheters
2002
With the zero-cross-detection method (ZCD) it has been shown that it is not possible to achieve a reproducible quantitative, and robust evaluation of an inter-coronary audio signal. The authors define spectral estimators to analyze the Doppler-audio signal. Measurements in a blood flow model have shown that the ZCD method underestimates the expected velocity at all speeds. Spectral analysis allows the determination of the actual and peak velocity more robustly and precisely. >
Decoding Children's Social Behavior
2013
We introduce a new problem domain for activity recognition: the analysis of children's social and communicative behaviors based on video and audio data. We specifically target interactions between children aged 1-2 years and an adult. Such interactions arise naturally in the diagnosis and treatment of developmental disorders such as autism. We introduce a new publicly-available dataset containing over 160 sessions of a 3-5 minute child-adult interaction. In each session, the adult examiner followed a semi-structured play interaction protocol which was designed to elicit a broad range of social behaviors. We identify the key technical challenges in analyzing these behaviors, and describe met…
Video preprocessing for audiovisual indexing
2003
We address the problem of detecting shots of subjects that are interviewed in news sequences. This is useful since usually these kinds of scenes contain important and reusable information that can be used for other news programs. In a previous paper, we presented a technique based on a priori knowledge of the editing techniques used in news sequences which allowed a fast search of news stories (see Albiol, A. et al., 3rd Int. Conf. on Audio and Video-based Biometric Person Authentication, p.366-71, 2001). We now present a new shot descriptor technique which improves the previous search results by using a simple, yet efficient, algorithm, based on the information contained in consecutive fra…